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11 Cardinality Rules 



11.1 Counting One Thing by Counting Another 

How do you count the number of people in a crowded room? You could count 
heads, since for each person there is exactly one head. Alternatively, you could 
count ears and divide by two. Of course, you might have to adjust the calculation 
if someone lost an ear in a pirate raid or someone was born with three ears. The 
point here is that you can often count one thing by counting another, though some 
fudge factors may be required. This is a central theme of counting, from the easiest 
problems to the hardest. 

In more formal terms, every counting problem comes down to determining the 
size of some set. The size or cardinality of a finite set S is the number of elements 
in S and it is denoted by \S\. In these terms, we're claiming that we can often find 
the size of one set by finding the size of a related set. We've already seen a general 
statement of this idea in the Mapping Rule of Theorem 7.2. 1. Of particular interest 
here is part 3 of Theorem 7.2.1, where we state that if there is a bijection between 
two sets, then the sets have the same size. This important fact is commonly known 
as the Bijection Rule. 

11.1.1 The Bijection Rule 

Rule 11.1.1 (Bijection Rule). If there is a bijection f : A -> B between A and B, 
then \A\ = \B\. 

The Bijection Rule acts as a magnifier of counting ability; if you figure out the 
size of one set, then you can immediately determine the sizes of many other sets 
via bijections. For example, consider the two sets mentioned at the beginning of 
Part EH: 

A = all ways to select a dozen doughnuts when five varieties are available 
B — all 16-bit sequences with exactly 4 ones 

Let's consider a particular element of set A: 

J)0_ 

chocolate lemon-filled sugar 

We've depicted each doughnut with a and left a gap between the different vari- 
eties. Thus, the selection above contains two chocolate doughnuts, no lemon-filled, 
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six sugar, two glazed, and two plain. Now let's put a 1 into each of the four gaps: 
J) 0^ 1 1 1 JM)^ 1 J) 0_^ 

chocolate lemon-filled sugar glazed plain 

We've just formed a 16-bit number with exactly 4 ones — an element of B ! 

This example suggests a bijection from set A to set B: map a dozen doughnuts 
consisting of: 

c chocolate, / lemon-filled, s sugar, g glazed, and p plain 
to the sequence: 




The resulting sequence always has 16 bits and exactly 4 ones, and thus is an 
element of B. Moreover, the mapping is a bijection; every such bit sequence is 
mapped to by exactly one order of a dozen doughnuts. Therefore, \A\ = \B \ by the 
Bijection Rule! 

This example demonstrates the magnifying power of the bijection rule. We man- 
aged to prove that two very different sets are actually the same size — even though 
we don't know exactly how big either one is. But as soon as we figure out the size 
of one set, we'll immediately know the size of the other. 

This particular bijection might seem frighteningly ingenious if you've not seen 
it before. But you'll use essentially this same argument over and over, and soon 
you'll consider it routine. 



11.2 Counting Sequences 

The Bijection Rule lets us count one thing by counting another. This suggests a 
general strategy: get really good at counting just a. few things and then use bijections 
to count everything else. This is the strategy we'll follow. In particular, we'll get 
really good at counting sequences. When we want to determine the size of some 
other set T, we'll find a bijection from T to a set of sequences S. Then we'll 
use our super- ninja sequence-counting skills to determine which immediately 
gives us \T\. We'll need to hone this idea somewhat as we go along, but that's 
pretty much the plan! 
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11.2. Counting Sequences 

11.2.1 The Product Rule 

The Product Rule gives the size of a product of sets. Recall that if P\, P 2 , . . . , P n 
are sets, then 

P 1 xP 2 x...xP n 

is the set of all sequences whose first term is drawn from Pi , second term is drawn 
from P2 and so forth. 

Rule 11.2.1 (Product Rule). IfP\,Pi, ■ ■ ■ Pn are sets, then: 

\P l xP 2 x...xP n \ = |Pi|-|P 2 |---|P«| 

For example, suppose a daily diet consists of a breakfast selected from set B , a 
lunch from set L, and a dinner from set D where: 

B — {pancakes, bacon and eggs, bagel, Doritos} 

L = {burger and fries, garden salad, Doritos} 

D = {macaroni, pizza, frozen burrito, pasta, Doritos} 

Then B x L x D is the set of all possible daily diets. Here are some sample elements: 

(pancakes, burger and fries, pizza) 
(bacon and eggs, garden salad, pasta) 
(Doritos, Doritos, frozen burrito) 

The Product Rule tells us how many different daily diets are possible: 

\B x L x D\ = \B\ ■ \L\ ■ \D\ 
= 4-3-5 
= 60. 

11.2.2 Subsets of an n-element Set 

How many different subsets of an n-element set X are there? For example, the set 
X = {x\ , X2, X3} has eight different subsets: 

{*i} {xi} {x\,x 2 } 
{x-i} {xi,x 3 } {x 2 ,x 3 } {Xi,X 2 ,X 3 }. 

There is a natural bijection from subsets of X to n-bit sequences. Let x\ , x 2 , . . . , x„ 
be the elements of X. Then a particular subset of X maps to the sequence (b\ , . . . , b n ) 
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where bi = 1 if and only if x\ is in that subset. For example, if n = 10, then the 
subset {x2, X3, X5,xj, x\o} maps to a 10-bit sequence as follows: 

subset: { X2, X3, X5, x-j, X10 } 

sequence: ( 0, 1, 1, 0, 1, 0, 1, 0, 0, 1 ) 

We just used a bijection to transform the original problem into a question about 
sequences — exactly according to plan! Now if we answer the sequence question, 
then we've solved our original problem as well. 

But how many different «-bit sequences are there? For example, there are 8 
different 3 -bit sequences: 

(0,0,0) (0,0,1) (0,1,0) (0,1,1) 

(1,0,0) (1,0,1) (1,1,0) (1,1,1) 

Well, we can write the set of all «-bit sequences as a product of sets: 
{0,l}x{0,l}x...x{0,l} = {0,l} n 

S v ' 

n terms 

Then Product Rule gives the answer: 

l{o,in = |{o,i}| n 

= 2" 

This means that the number of subsets of an « -element set X is also 2". We'll 
put this answer to use shortly. 

11.2.3 The Sum Rule 

Linus allocates his big sister Lucy a quota of 20 crabby days, 40 irritable days, 
and 60 generally surly days. On how many days can Lucy be out-of-sorts one way 
or another? Let set C be her crabby days, / be her irritable days, and S be the 
generally surly. In these terms, the answer to the question is \C U / U S\. Now 
assuming that she is permitted at most one bad quality each day, the size of this 
union of sets is given by the Sum Rule: 

Rule 11.2.2 (Sum Rule). IfA u A 2 ,..., A n are disjoint sets, then: 

\A X UA 2 U ...UA n \ = \Ai\ + \A 2 \ + ... + \A„\ 

Thus, according to Linus' budget, Lucy can be out-of-sorts for: 

\CUIUS\ = \C\ + |/| + |5| 
= 20 + 40 + 60 
= 120 days 
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11.3. The Generalized Product Rule 

Notice that the Sum Rule holds only for a union of disjoint sets. Finding the size 
of a union of intersecting sets is a more complicated problem that we'll take up 
later. 

11.2.4 Counting Passwords 

Few counting problems can be solved with a single rule. More often, a solution 
is a flurry of sums, products, bijections, and other methods. For example, the sum 
and product rules together are useful for solving problems involving passwords, 
telephone numbers, and license plates. For example, on a certain computer system, 
a valid password is a sequence of between six and eight symbols. The first symbol 
must be a letter (which can be lowercase or uppercase), and the remaining symbols 
must be either letters or digits. How many different passwords are possible? 

Let's define two sets, corresponding to valid symbols in the first and subsequent 
positions in the password. 

F = {a,b,...,z,A,B,...,Z} 

S = {a,b,...,z,A,B,...,Z,Q,\,...,9} 

In these terms, the set of all possible passwords is: 1 

(F x S 5 ) U(fx S 6 ) U (F x S 7 ) 

Thus, the length-six passwords are in the set F x S 5 , the length-seven passwords 
are in F x S 6 , and the length-eight passwords are in F x S 7 . Since these sets 
are disjoint, we can apply the Sum Rule and count the total number of possible 
passwords as follows: 

\(F x S 5 ) U(fx S 6 ) U (F x 5 7 )| 

= \F x S 5 \ + \F x 5 6 | + \F x S 7 \ Sum Rule 

= |F| • |5| 5 + \F\ ■ \S\ 6 + \F\ ■ \S\ 7 Product Rule 

= 52 • 62 5 + 52 • 62 6 + 52 • 62 7 
1.8 • 10 14 different passwords. 



11.3 The Generalized Product Rule 

We realize everyone has been working pretty hard this term, and we're considering 
awarding some prizes for truly exceptional coursework. Here are some possible 

lr rhe notation S 5 means 5x5x5x5x5. 
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categories: 

Best Administrative Critique We asserted that the quiz was closed-book. On the 
cover page, one strong candidate for this award wrote, "There is no book." 

Awkward Question Award "Okay, the left sock, right sock, and pants are in an 
antichain, but how — even with assistance — could I put on all three at once?" 

Best Collaboration Statement Inspired by a student who wrote "I worked alone" 
on Quiz 1. 

In how many ways can, say, three different prizes be awarded to n people? This 
is easy to answer using our strategy of translating the problem about awards into 
a problem about sequences. Let P be the set of n people taking the course. Then 
there is a bijection from ways of awarding the three prizes to the set P 3 ::= P x 
P x P. In particular, the assignment: 

"person x wins prize #1, y wins prize #2, and z wins prize #3" 

maps to the sequence (x, y, z). By the Product Rule, we have |P 3 | = |P| 3 = n 3 , 
so there are n 3 ways to award the prizes to a class of n people. 

But what if the three prizes must be awarded to different students? As before, we 
could map the assignment 

"person x wins prize #1, y wins prize #2, and z wins prize #3" 

to the triple (x, y, z) e P 3 . But this function is no longer a bijection. For example, 
no valid assignment maps to the triple (Dave, Dave, Becky) because Dave is not 
allowed to receive two awards. However, there is a bijection from prize assignments 
to the set: 

S = {(x, y, z) e P 3 | x, y, and z are different people} 

This reduces the original problem to a problem of counting sequences. Unfortu- 
nately, the Product Rule is of no help in counting sequences of this type because 
the entries depend on one another; in particular, they must all be different. How- 
ever, a slightly sharper tool does the trick. 

Rule 11.3.1 (Generalized Product Rule). Let S be a set oflength-k sequences. If 
there are: 

• n\ possible first entries, 

• «2 possible second entries for each first entry, 
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• ft 3 possible third entries for each combination of first and second entries, etc. 
then: 

\S\ = U\ -«2 -«3 

In the awards example, S consists of sequences (x, y,z). There are n ways to 
choose x, the recipient of prize #1. For each of these, there are n — 1 ways to choose 
y, the recipient of prize #2, since everyone except for person x is eligible. For each 
combination of x and y, there are n — 2 ways to choose z, the recipient of prize #3, 
because everyone except for x and y is eligible. Thus, according to the Generalized 
Product Rule, there are 

\S\ =n-(n- l).(n-2) 
ways to award the 3 prizes to different people. 

11.3.1 Defective Dollar Bills 

A dollar bill is defective if some digit appears more than once in the 8-digit serial 
number. If you check your wallet, you'll be sad to discover that defective bills 
are all-too-common. In fact, how common are nondefective bills? Assuming that 
the digit portions of serial numbers all occur equally often, we could answer this 
question by computing 

|{serial #'s with all digits different}| 

fraction of nondefective bills = ■ . (11.1) 

| {serial numbers} | 

Let's first consider the denominator. Here there are no restrictions; there are are 10 
possible first digits, 10 possible second digits, 10 third digits, and so on. Thus, the 
total number of 8-digit serial numbers is 10 8 by the Product Rule. 

Next, let's turn to the numerator. Now we're not permitted to use any digit twice. 
So there are still 10 possible first digits, but only 9 possible second digits, 8 possible 
third digits, and so forth. Thus, by the Generalized Product Rule, there are 

10! 

10-9-8-7-6-5-4-3 = — = 1,814,400 

2 

serial numbers with all digits different. Plugging these results into Equation 11.1, 
we find: 

1,814,400 

fraction of nondefective bills = = 1.8144% 

100,000,000 
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abcdefgh abcdefgh 
(a) valid (b) invalid 



Figure 11.1 Two ways of placing a pawn (A), a knight (€Y), and a bishop (A) on 
a chessboard. The configuration shown in (b) is invalid because the bishop and the 
knight are in the same row. 

11.3.2 A Chess Problem 

In how many different ways can we place a pawn (P), a knight (AO, and a bishop 
(fi) on a chessboard so that no two pieces share a row or a column? A valid con- 
figuration is shown in Figure 11.1(a), and an invalid configuration is shown in Fig- 
ure 11.1(b). 

First, we map this problem about chess pieces to a question about sequences. 
There is a bijection from configurations to sequences 

(r P ,cp,r N ,c N ,r B ,CB) 

where rp, rpf, and rg are distinct rows and cp, cn, and cb are distinct columns. 
In particular, is the pawn's row, cp is the pawn's column, is the knight's 
row, etc. Now we can count the number of such sequences using the Generalized 
Product Rule: 

• rp is one of 8 rows 

• cp is one of 8 columns 

• rjy is one of 7 rows (any one but rp) 

• cjy is one of 7 columns (any one but cp) 

• r B is one of 6 rows (any one but rp or r/v) 

• cb is one of 6 columns (any one but cp or cjy ) 

Thus, the total number of configurations is (8 • 7 ■ 6) 2 . 
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11.4. The Division Rule 

11.3.3 Permutations 

A permutation of a set 5* is a sequence that contains every element of 5* exactly 
once. For example, here are all the permutations of the set {a, b, c}: 



How many permutations of an n -element set are there? Well, there are n choices 
for the first element. For each of these, there are n — 1 remaining choices for the 
second element. For every combination of the first two elements, there are n — 2 
ways to choose the third element, and so forth. Thus, there are a total of 



permutations of an n -element set. In particular, this formula says that there are 
3! = 6 permutations of the 3-element set {a, b, c}, which is the number we found 
above. 

Permutations will come up again in this course approximately 1.6 bazillion times. 
In fact, permutations are the reason why factorial comes up so often and why we 
taught you Stirling's approximation: 



Counting ears and dividing by two is a silly way to count the number of people in 
a room, but this approach is representative of a powerful counting principle. 

A k-to-1 function maps exactly k elements of the domain to every element of 
the codomain. For example, the function mapping each ear to its owner is 2-to- 1 . 
Similarly, the function mapping each finger to its owner is 10-to-l, and the function 
mapping each finger and toe to its owner is 20-to-l. The general rule is: 

Rule 11.4.1 (Division Rule). If f : A B is k-to-1, then \A\ = k ■ \B\. 

For example, suppose A is the set of ears in the room and B is the set of people. 
There is a 2-to-l mapping from ears to people, so by the Division Rule, \A\ = 
2- \B\. Equivalently, |S| = \A\/2, expressing what we knew all along: the number 
of people is half the number of ears. Unlikely as it may seem, many counting 
problems are made much easier by initially counting every item multiple times and 
then correcting the answer using the Division Rule. Let's look at some examples. 



(a,b,c) (a,c,b) (b,a,c) 
(b,c,a) (c,a,b) (c,b,a) 



«-(n-l)-(«-2)---3-2-l =n\ 




11.4 The Division Rule 
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abcdefgh abcdefgh 
(a) valid (b) invalid 



Figure 11.2 Two ways to place 2 rooks (H) on a chessboard. The configuration 
in (b) is invalid because the rooks are in the same column. 

11.4.1 Another Chess Problem 

In how many different ways can you place two identical rooks on a chessboard 
so that they do not share a row or column? A valid configuration is shown in 
Figure 11.2(a), and an invalid configuration is shown in Figure 11.2(b). 
Let A be the set of all sequences 

(ri,ci,r 2 ,c 2 ) 

where r\ and r 2 are distinct rows and c\ and c 2 are distinct columns. Let B be the 
set of all valid rook configurations. There is a natural function / from set A to set 
B; in particular, / maps the sequence (r\,c\, r 2 , c 2 ) to a configuration with one 
rook in row r\, column c\ and the other rook in row r 2 , column c 2 . 
But now there's a snag. Consider the sequences: 

(1,1,8,8) and (8,8,1,1) 

The first sequence maps to a configuration with a rook in the lower-left corner and 
a rook in the upper-right corner. The second sequence maps to a configuration with 
a rook in the upper-right corner and a rook in the lower-left corner. The problem is 
that those are two different ways of describing the same configuration! In fact, this 
arrangement is shown in Figure 11.2(a). 

More generally, the function / maps exactly two sequences to every board con- 
figuration; that is / is a 2-to-l function. Thus, by the quotient rule, \A\ = 2- \B\. 
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Rearranging terms gives: 

\A\ (8-7) : 



\B\ 



2 2 

On the second line, we've computed the size of A using the General Product Rule 
just as in the earlier chess problem. 

11.4.2 Knights of the Round Table 

In how many ways can King Arthur seat n different knights at his round table? Two 
seatings are considered equivalent if one can be obtained from the other by rotation. 
For example, the following two arrangements are equivalent: 

k 3 
ki 




Let A be all the permutations of the knights, and let B be the set of all possible 
seating arrangements at the round table. We can map each permutation in set A to a 
circular seating arrangement in set B by seating the first knight in the permutation 
anywhere, putting the second knight to his left, the third knight to the left of the 
second, and so forth all the way around the table. For example: 




This mapping is actually an n-to-1 function from A to B, since all n cyclic shifts of 
the original sequence map to the same seating arrangement. In the example, n = 4 
different sequences map to the same seating arrangement: 



{k 2 ,k A ,k\,k-i) 
{kA,ki,k-i,k 2 ) 
(ki,k 3 ,k 2 ,k 4 ) 
(k 3 ,k 2 ,k 4 ,ki) 
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Therefore, by the division rule, the number of circular seating arrangements is: 

\B\ = ^ = - = (n- 1)! 
n n 

Note that | A \ = n \ since there are n ! permutations of n knights. 



11.5 Counting Subsets 



How many ^-element subsets of an M-element set are there? This question arises 
all the time in various guises: 

• In how many ways can I select 5 books from my collection of 100 to bring 
on vacation? 

• How many different 13-card Bridge hands can be dealt from a 52-card deck? 

• In how many ways can I select 5 toppings for my pizza if there are 14 avail- 
able toppings? 

This number comes up so often that there is a special notation for it: 
:= the number of k -element subsets of an n -element set. 



' n 



The expression [ 1 is read "n choose k." Now we can immediately express the 

w 

answers to all three questions above: 

/ ioo\ 

• I can select 5 books from 100 in I I ways. 

/52\ 

• There are I I different Bridge hands. 

M 

• There are I I different 5-topping pizzas, if 14 toppings are available. 
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11.5. Counting Subsets 

11.5.1 The Subset Rule 

We can derive a simple formula for the «-choose-/c number using the Division Rule. 
We do this by mapping any permutation of an w -element set {a\ , . . . , a n } into a k- 
element subset simply by taking the first k elements of the permutation. That is, 
the permutation a\a2 ...a n will map to the set {a\ , ai, . . . , a^}. 

Notice that any other permutation with the same first k elements a\,...,a^ in 
any order and the same remaining elements n — k elements in any order will also 
map to this set. What's more, a permutation can only map to {a\, ai, . . . , a^} 
if its first k elements are the elements a\,...,ak in some order. Since there are 
k ! possible permutations of the first k elements and (n — k) \ permutations of the 
remaining elements, we conclude from the Product Rule that exactly k\(n — k)\ 
permutations of the n -element set map to the the particular subset, S. In other 
words, the mapping from permutations to k -element subsets is k\(n — fc)!-to-l. 

But we know there are n \ permutations of an n-element set, so by the Division 
Rule, we conclude that 

n\ = k\(n-k)\ 

which proves: 

Rule 11.5.1 (Subset Rule). The number ofk-element subsets of an n-element set is 

(n\ n\ 
kj = kl (n-k)l' 

Notice that this works even for 0-element subsets: n\/0\n\ = 1. Here we use the 
fact that 0! is a product of terms, which by convention 2 equals 1. 

11.5.2 Bit Sequences 

How many n-bit sequences contain exactly k ones? We've already seen the straight- 
forward bijection between subsets of an ^-element set and n-bit sequences. For 
example, here is a 3-element subset of {x\,X2, ■ ■ ■ ,x%} and the associated 8-bit 
sequence: 

{ Xl, X4, X5 } 

( 1, 0, 0, 1, 1, 0, 0, ) 

Notice that this sequence has exactly 3 ones, each corresponding to an element 
of the 3-element subset. More generally, the «-bit sequences corresponding to a 
/:-element subset will have exactly k ones. So by the Bijection Rule, 

2 We don't use it here, but a sum of zero terms equals 0. 
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The number of «-bit sequences with exactly k ones is 



(: 



k 



n 



) 



11.6 Sequences with Repetitions 



11.6.1 Sequences of Subsets 

Choosing a k -element subset of an n -element set is the same as splitting the set 
into a pair of subsets: the first subset of size k and the second subset consisting of 
the remaining n — k elements. So the Subset Rule can be understood as a rule for 
counting the number of such splits into pairs of subsets. 

We can generalize this to splits into more than two subsets. Namely, let A be 
an n-element set and k\, ki, . . . , k m be nonnegative integers whose sum is n. A 
(k\ , ki, . . . , k m )-split of A is a sequence 



where the Aj are disjoint subsets of A and \Af \ = ki for i = 1, . . . , m. 

Rule 11.6.1 (Subset Split Rule). The number of (k\, ki, . . . , k m )-splits of an n- 
element set is 



The proof of this Rule is essentially the same as for the Subset Rule. Namely, we 
map any permutation a \Cii ... a n of an ^-element set A into a (k\, ki, . . . , £ m )-split 
by letting the 1st subset in the split be the first k\ elements of the permutation, the 
2nd subset of the split be the next ki elements, . . . , and the mth subset of the split be 
the final k m elements of the permutation. This map is a k\ ! ki\ ■■■ k m !-to-l func- 
tion from the n \ permutations to the {k\,ki, . . . , /c m )-splits of A, and the Subset 
Split Rule now follows from the Division Rule. 

11.6.2 The Bookkeeper Rule 

We can also generalize our count of n-bit sequences with k ones to counting se- 
quences of n letters over an alphabet with more than two letters. For example, 
how many sequences can be formed by permuting the letters in the 10-letter word 



BOOKKEEPER? 

Notice that there are 1 B, 2 O's, 2 K's, 3 E's, 1 P, and 1 R in BOOKKEEPER. This 
leads to a straightforward bijection between permutations of BOOKKEEPER and 



(A 1 ,A 2 ,...,A m ) 
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(1,2,2,3, l,l)-splits of {1,2,..., 10}. Namely, map a permutation to the sequence 
of sets of positions where each of the different letters occur. 

For example, in the permutation BOOKKEEPER itself, the B is in the 1st posi- 
tion, the O's occur in the 2nd and 3rd positions, K's in 4th and 5th, the E's in the 
6th, 7th and 9th, P in the 8th, and R is in the 10th position. So BOOKKEEPER 
maps to 

({1}, {2, 3}, {4, 5}, {6, 7, 9}, {8}, {10}). 

From this bijection and the Subset Split Rule, we conclude that the number of ways 
to rearrange the letters in the word BOOKKEEPER is: 



B's O's K's E's P's R's 



This example generalizes directly to an exceptionally useful counting principle 
which we will call the 

Rule 11.6.2 (Bookkeeper Rule). Let l\, . . . , l m be distinct elements. The number 
of sequences with k\ occurrences of l\, and ki occurrences of h, ■ ■ ■ , and k m 
occurrences ofl m is 

(k 1 +k 2 + ... + k m )\ 
k\\k 2 \ ... k m \ 

For example, suppose you are planning a 20-mile walk, which should include 5 
northward miles, 5 eastward miles, 5 southward miles, and 5 westward miles. How 
many different walks are possible? 

There is a bijection between such walks and sequences with 5 N's, 5 E's, 5 S's, 
and 5 W's. By the Bookkeeper Rule, the number of such sequences is: 

20! 
5!*" 

11.6.3 The Binomial Theorem 

Counting gives insight into one of the basic theorems of algebra. A binomial is a 
sum of two terms, such as a + b. Now consider its 4th power, (a + b) 4 . 
If we multiply out this 4th power expression completely, we get 

(a + b) A = aaaa + aaab + aaba + aabb 

+ abaa + abab + abba + abbb 

+ baaa + baab + baba + babb 

+ bbaa + bbab + bbba + bbbb 
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Notice that there is one term for every sequence of a's and b's. So there are 2 4 
terms, and the number of terms with k copies of b and n — k copies of a is: 



k\ (n-k)\ 

by the Bookkeeper Rule. Hence, the coefficient of a n ~ k b k is So for n = 4, 
this means: 

a 3 b 1 + fy-a 2 b 2 +^-a 1 b 3 + 

In general, this reasoning gives the Binomial Theorem: 
Theorem 11.6.3 (Binomial Theorem). For all n e N and a,b eR: 

k u k 





(a + bf = \ n k ) an ~ kb 



k=0 



n 



The expression I is often called a "binomial coefficient" in honor of its ap- 

w 

pearance here. 

This reasoning about binomials extends nicely to multinomials, which are sums 
of two or more terms. For example, suppose we wanted the coefficient of 

bo 2 k 2 e 3 pr 

in the expansion of (b + o + k + e + p + r) 10 . Each term in this expansion is a 
product of 10 variables where each variable is one of b, o, k, e, p, or r. Now, the 
coefficient of bo 2 k 2 e 3 pr is the number of those terms with exactly I b, 2 o's, 2 
k's, 3 e's, 1 p, and 1 r. And the number of such terms is precisely the number of 
rearrangements of the word BOOKKEEPER: 

10! 




1! 2! 2! 3! 1! 1! 



The expression on the left is called a "multinomial coefficient." This reasoning 
extends to a general theorem. 

Definition 11.6.4. For n,k\, . . . ,k m e N, such that k\ + ki H V k m = n, define 

the multinomial coefficient 




h\k 2 \ ...k m V 
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11.7. Counting Practice: Poker Hands 



Theorem 11.6.5 (Multinomial Theorem). For all neff, 



(zi + z 2 + • • • + z m ) n = 




m 



You'll be better off remembering the reasoning behind the Multinomial Theorem 
rather than this ugly formal statement. 

11.6.4 A Word about Words 

Someday you might refer to the Subset Split Rule or the Bookkeeper Rule in front 
of a roomful of colleagues and discover that they're all staring back at you blankly. 
This is not because they're dumb, but rather because we made up the name "Book- 
keeper Rule". However, the rule is excellent and the name is apt, so we suggest 
that you play through: "You know? The Bookkeeper Rule? Don't you guys know 
anything???" 

The Bookkeeper Rule is sometimes called the "formula for permutations with 
indistinguishable objects." The size k subsets of an ^-element set are sometimes 
called k -combinations. Other similar-sounding descriptions are "combinations with 
repetition, permutations with repetition, r -permutations, permutations with indis- 
tinguishable objects," and so on. However, the counting rules we've taught you are 
sufficient to solve all these sorts of problems without knowing this jargon, so we 
won't burden you with it. 



Five-Card Draw is a card game in which each player is initially dealt a hand con- 
sisting of 5 cards from a deck of 52 cards. 3 (Then the game gets complicated, but 
let's not worry about that.) The number of different hands in Five-Card Draw is the 

3 There are 52 cards in a standard deck. Each card has a suit and a rank. There are four suits: 



11.7 Counting Practice: Poker Hands 



A (spades) 



V (hearts) 



* (clubs) 



O (diamonds) 



And there are 13 ranks, listed here from lowest to highest: 



Ace Jack Queen King 

4,2,3,4,5,6,7,8,9,/, Q , K 



Thus, for example, 8(? is the 8 of hearts and is the ace of spades. 
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number of 5-element subsets of a 52-element set, which is 




= 2, 598, 960. 



Let's get some counting practice by working out the number of hands with various 
special properties. 

11.7.1 Hands with a Four-of-a-Kind 

A Four-of-a-Kind is a set of four cards with the same rank. How many different 
hands contain a Four-of-a-Kind? Here are a couple examples: 

{8ft, 80, g9, 89, 8*} 
{A*, 2*, 29, 20, 2*} 

As usual, the first step is to map this question to a sequence-counting problem. A 
hand with a Four-of-a-Kind is completely described by a sequence specifying: 

1 . The rank of the four cards. 

2. The rank of the extra card. 

3. The suit of the extra card. 

Thus, there is a bijection between hands with a Four-of-a-Kind and sequences con- 
sisting of two distinct ranks followed by a suit. For example, the three hands above 
are associated with the following sequences: 

(8,2,9) ++{ 84k, 80, 89, 8*, 29} 
(2,4,*) o{2* 29, 20, 2*, A*} 

Now we need only count the sequences. There are 13 ways to choose the first rank, 
12 ways to choose the second rank, and 4 ways to choose the suit. Thus, by the 
Generalized Product Rule, there are 13 • 12 • 4 = 624 hands with a Four-of-a-Kind. 
This means that only 1 hand in about 4165 has a Four-of-a-Kind. Not surprisingly, 
Four-of-a-Kind is considered to be a very good poker hand! 
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11.7.2 Hands with a Full House 

A Full House is a hand with three cards of one rank and two cards of another rank. 
Here are some examples: 

{24k, 2*, 20, /*, JO} 

{50, 5*, 5<?, 7<?, 7*} 

Again, we shift to a problem about sequences. There is a bijection between Full 
Houses and sequences specifying: 

1. The rank of the triple, which can be chosen in 13 ways. 

2. The suits of the triple, which can be selected in ( 3 ) ways. 

3. The rank of the pair, which can be chosen in 12 ways. 

4. The suits of the pair, which can be selected in ( 2 ) ways. 
The example hands correspond to sequences as shown below: 

(2, {ft, ft, O}, /, {*, O}) o {2ft, 2ft, 20, /*, JO} 

(5, {O, ft, 9}, 7, {CP, *}) o {50, 5ft, 5C?, 7C?, 7*} 

By the Generalized Product Rule, the number of Full Houses is: 

We're on a roll — but we're about to hit a speed bump. 

11.7.3 Hands with Two Pairs 

How many hands have Two Pairs; that is, two cards of one rank, two cards of 
another rank, and one card of a third rank? Here are examples: 

{30, 3ft, gO, QV, A*} 
{9C?, 90, 5<?, 5ft, ^ft} 

Each hand with Two Pairs is described by a sequence consisting of: 

1. The rank of the first pair, which can be chosen in 13 ways. 

2. The suits of the first pair, which can be selected ( 2 ) ways. 
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3. The rank of the second pair, which can be chosen in 12 ways. 

4. The suits of the second pair, which can be selected in (2) ways. 

5. The rank of the extra card, which can be chosen in 1 1 ways. 

6. The suit of the extra card, which can be selected in (j) = 4 ways. 
Thus, it might appear that the number of hands with Two Pairs is: 




Wrong answer! The problem is that there is not a bijection from such sequences to 
hands with Two Pairs. This is actually a 2-to-l mapping. For example, here are the 
pairs of sequences that map to the hands given above: 



(3,{o,*},e,{<>,^M,*) \ 

(g, {❖,<?}, 3, {❖,♦}, 4, •) / 

(9, {CP, 5, {<?,»},*,♦) \ 

(5,{C?,*},9,{C?,0},tf,4>) / 



{30, 3*, 20, QV, A*} 



{99, 90, 59, 5*, K*} 



The problem is that nothing distinguishes the first pair from the second. A pair of 
5's and a pair of 9's is the same as a pair of 9's and a pair of 5's. We avoided this 
difficulty in counting Full Houses because, for example, a pair of 6's and a triple of 
kings is different from a pair of kings and a triple of 6's. 

We ran into precisely this difficulty last time, when we went from counting ar- 
rangements of different pieces on a chessboard to counting arrangements of two 
identical rooks. The solution then was to apply the Division Rule, and we can do 
the same here. In this case, the Division rule says there are twice as many sequences 
as hands, so the number of hands with Two Pairs is actually: 



13- g). 12- g). 11- 4 



Another Approach 



The preceding example was disturbing ! One could easily overlook the fact that the 
mapping was 2-to-l on an exam, fail the course, and turn to a life of crime. You 
can make the world a safer place in two ways: 
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1 . Whenever you use a mapping / : A -> B to translate one counting problem 
to another, check that the same number elements in A are mapped to each 
element in B. If A: elements of A map to each of element of B, then apply 
the Division Rule using the constant k. 

2. As an extra check, try solving the same problem in a different way. Multiple 
approaches are often available — and all had better give the same answer! 
(Sometimes different approaches give answers that look different, but turn 
out to be the same after some algebra.) 

We already used the first method; let's try the second. There is a bijection be- 
tween hands with two pairs and sequences that specify: 

1. The ranks of the two pairs, which can be chosen in ways. 

2. The suits of the lower-rank pair, which can be selected in (^) ways. 

3. The suits of the higher-rank pair, which can be selected in (2) ways. 

4. The rank of the extra card, which can be chosen in 1 1 ways. 

5. The suit of the extra card, which can be selected in (j) = 4 ways. 
For example, the following sequences and hands correspond: 

({3, Q}, {O, 4k}, {O, <?}, A, 4k) o {30, 3*. QO, QV, A*} 
({9, 5}, {CP, *}, {(?, O}, K, 4k) o {9®, 90, 5<P, 54k, K*} 

Thus, the number of hands with two pairs is: 




This is the same answer we got before, though in a slightly different form. 
11.7.4 Hands with Every Suit 

How many hands contain at least one card from every suit? Here is an example of 
such a hand: 

{70, ^4k, 30, AV, 24k} 

Each such hand is described by a sequence that specifies: 

1. The ranks of the diamond, the club, the heart, and the spade, which can be 
selected in 13 • 13 • 13 • 13 = 13 4 ways. 
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2. The suit of the extra card, which can be selected in 4 ways. 

3. The rank of the extra card, which can be selected in 12 ways. 
For example, the hand above is described by the sequence: 

(7, K, A, 2, 0, 3) o {70, K*, AV, 2*, 30}. 

Are there other sequences that correspond to the same hand? There is one more! 
We could equally well regard either the 30 or the 70 as the extra card, so this 
is actually a 2-to-l mapping. Here are the two sequences corresponding to the 
example hand: 

(7,K,A,2,0,3) \ 

{70, AV, 24k, 30} 

(3,7^,^,2,0,7) / 
Therefore, the number of hands with every suit is: 

13 4 - 4 - 12 

2 ' 



11.8 Inclusion-Exclusion 

How big is a union of sets? For example, suppose there are 60 math majors, 200 
EECS majors, and 40 physics majors. How many students are there in these three 
departments? Let M be the set of math majors, E be the set of EECS majors, and 
P be the set of physics majors. In these terms, we're asking for \M U E U P\. 
The Sum Rule says that if M, E, and P are disjoint, then the sum of their sizes 

is 

\M U E U P\ = \M\ + \E\ + \P\. 

However, the sets M, E, and P might not be disjoint. For example, there might 
be a student majoring in both math and physics. Such a student would be counted 
twice on the right side of this equation, once as an element of M and once as an 
element of P. Worse, there might be a triple-major 4 counted three times on the 
right side! 

Our most-complicated counting rule determines the size of a union of sets that 
are not necessarily disjoint. Before we state the rule, let's build some intuition by 
considering some easier special cases: unions of just two or three sets. 

4 . . . though not at MIT anymore. 
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11.8.1 Union of Two Sets 

For two sets, Si and 5*2, the Inclusion-Exclusion Rule is that the size of their union 
is: 

|Si us 2 | = |Si| + |s 2 |-|Si ns 2 | (1L2) 

Intuitively, each element of Si is accounted for in the first term, and each element 
of S 2 is accounted for in the second term. Elements in both S\ and S2 are counted 
twice — once in the first term and once in the second. This double-counting is cor- 
rected by the final term. 



11.8.2 Union of Three Sets 

So how many students are there in the 
other words, what is \M U E U P | if: 



math, EECS, and physics departments? In 



\M\ = 60 
\E\ = 200 
|P| = 40. 

The size of a union of three sets is given by a more complicated Inclusion-Exclusion 
formula: 

|SiU5 2 U5 3 | = |5il + |S2| + |5 3 | 

-|5i nSzl-ISi ns 3 |-|s 2 ns 3 | 
+ |Sins 2 ns 3 |. 

Remarkably, the expression on the right accounts for each element in the union of 
S\, S2, and S3 exactly once. For example, suppose that x is an element of all three 
sets. Then x is counted three times (by the \S\\, |5 2 |, and |S 3 | terms), subtracted 
off three times (by the \Si fl 5 2 |, \ Si fl 5 3 |, and |5 2 D S 3 | terms), and then counted 
once more (by the \Si fl 5 2 fl 5 3 | term). The net effect is that x is counted just 
once. 

If x is in two sets (say, Si and S2), then x is counted twice (by the \S\ | and |5 2 | 
terms) and subtracted once (by the |5i fl 5 2 | term). In this case, x does not factor 
into any of the other terms, since x ^ S3. 

So we can't answer the original question without knowing the sizes of the various 
intersections. Let's suppose that there are: 

4 math - EECS double majors 

3 math - physics double majors 

1 1 EECS - physics double majors 

2 triple majors 
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Then|Mn£| = 4 + 2, \MDP\ = 3+2, \EDP\ = 11 + 2, and \MnEC\P\ = 2. 
Plugging all this into the formula gives: 

\M u E u P\ = \M\ + \E\ + |P| - \M n E\ - \M n P\ - \E n P \ + \M n E n P 

= 60 + 200 + 40- 6-5- 13 + 2 

= 278 

11.8.3 Sequences with 42, 04, or 60 

In how many permutations of the set {0, 1, 2, ... , 9} do either 4 and 2, and 4, or 
6 and appear consecutively? For example, none of these pairs appears in: 

(7,2,9,5,4,1,3,8,0,6). 

The 06 at the end doesn't count; we need 60. On the other hand, both 04 and 60 
appear consecutively in this permutation: 

(7,2,5,6,0,4,3,8,1,9). 

Let P42 be the set of all permutations in which 42 appears. Define Peo and P04 
similarly. Thus, for example, the permutation above is contained in both P^o and 
P04, but not P42. In these terms, we're looking for the size of the set P42 U P04 U 
Peo- 

First, we must determine the sizes of the individual sets, such as P^o- We can 
use a trick: group the 6 and together as a single symbol. Then there is a natural 
bijection between permutations of {0, 1, 2, . . . 9} containing 6 and consecutively 
and permutations of: 

{60,1,2,3,4,5,7,8,9}. 
For example, the following two sequences correspond: 

(7,2,5,6,0,4,3,8,1,9) (7,2,5,60,4,3,8,1,9). 

There are 9! permutations of the set containing 60, so |P6o| = 9! by the Bijection 
Rule. Similarly, |Po4| = I-P42I = 9! as well. 

Next, we must determine the sizes of the two-way intersections, such as P42 n 
P60- Using the grouping trick again, there is a bijection with permutations of the 
set: 

{42,60, 1,3,5,7,8,9}. 
Thus, IP42 n ^60 1 = 8!. Similarly, |P60 H PmI = 8! by a bijection with the set: 

{604, 1,2,3,5,7,8,9}. 
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And I P42 n P04 1 = 8 ! as well by a similar argument. Finally, note that | P^o H 
P04 H P42I = 7! by a bijection with the set: 

{6042, 1,3,5,7,8,9}. 
Plugging all this into the formula gives: 

\P 42 U P 4 U P 6 o| = 9! + 9! + 9! - 8! - 8! - 8! + 7!. 
11.8.4 Union of n Sets 

The size of a union of n sets is given by the following rule. 
Rule 11.8.1 (Inclusion-Exclusion). 

I Si u s 2 u ••• u s n \ = 

the sum of the sizes of the individual sets 
minus the sizes of all two-way intersections 

plus the sizes of all three-way intersections 
minus the sizes of all four-way intersections 

plus the sizes of all five-way intersections, etc. 

The formulas for unions of two and three sets are special cases of this general 
rule. 

This way of expressing Inclusion-Exclusion is easy to understand and nearly 
as precise as expressing it in mathematical symbols, but we'll need the symbolic 
version below, so let's work on deciphering it now. 

We already have a standard notation for the sum of sizes of the individual sets, 
namely, 

n 

Ei* 1. 

i = i 

A "two-way intersection" is a set of the form S,- D Sj for i ^ j . We regard Sj D S; 
as the same two-way intersection as Si D Sj , so we can assume that i < j . Now 
we can express the sum of the sizes of the two-way intersections as 

E \StnSj\. 

l<i< j<n 

Similarly, the sum of the sizes of the three-way intersections is 

\Si n Sj n s k \. 

l<i<j<k<n 
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These sums have alternating signs in the Inclusion-Exclusion formula, with the 
sum of the fc-way intersections getting the sign (— \) k ~ l . This finally leads to a 
symbolic version of the rule: 

Rule (Inclusion-Exclusion). 

n n 

il>i=Ei*i 

i=l i=l 

~ E \ S i nS j\ 
l<i<j<n 

+ £ \SinSjns k \ + — 

1 <;' < j<k<n 

+ (-l)»- 1 |f)S i |. 

i = \ 

11.8.5 Computing Euler's Function 

As an example, let's use Inclusion-Exclusion to calculate Euler's function, </>(«). 
By definition, <p(n) is the number of nonnegative integers less than a positive inte- 
ger n that are relatively prime to n. But the set S of nonnegative integers less than n 
that are not relatively prime to n will be easier to count. 

Suppose the prime factorization of n is ' ' ' Pm for distinct primes p(. This 
means that the integers in S are precisely the nonnegative integers less than n that 
are divisible by at least one of the pt 's. Letting Q be the set of nonnegative integers 
less than n that are divisible by pi , we have 

m 

S = (jQ. 

i = l 

We'll be able to find the size of this union using Inclusion-Exclusion because 
the intersections of the Q 's are easy to count. For example, C\ fl C2 D C3 is the 
set of nonnegative integers less than n that are divisible by each of p\, pi and p^. 
But since the pi 's are distinct primes, being divisible by each of these primes is the 
same as being divisible by their product. Now observe that if r is a positive divisor 
of n, then exactly n/r nonnegative integers less than n are divisible by r, namely, 
0, r,2r, ((n/r) — l)r. So exactly n/ p\ P2P3 nonnegative integers less than n 
are divisible by all three primes p\, P2, Ps- In other words, 

Id nc 2 nc 3 | = — - — . 

P1P2P3 
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Reasoning this way about all the intersections among the Q's and applying 
Inclusion-Exclusion, we get 



\S\ = \{jQ\ 



i = l 
m 



= X)IQ|- E l c ' nc 7l+ E \c i nc J nc k \---- + (-i) m - 1 \f]c i \ 

i = l 1 <; < j<m l<i<j<k<m i = l 

m 

= y!L_ y ^+ y + (-iy"- 1 — " 

= HE 1 - E — + E — + (-D m - 1 — 1 

But = « — |5| by definition, so 

Hn) = n(i-jr-+ e — - E - 1 — +-+(-ir 1 



m 



-n (>-£)■ u.3, 

Yikes ! That was pretty hairy. Are you getting tired of all that nasty algebra? If 
so, then good news is on the way. In the next section, we will show you how to 
prove some heavy-duty formulas without using any algebra at all. Just a few words 
and you are done. No kidding. 



11.9 Combinatorial Proofs 

Suppose you have n different T-shirts, but only want to keep k. You could equally 
well select the k shirts you want to keep or select the complementary set of n — k 
shirts you want to throw out. Thus, the number of ways to select k shirts from 
among n must be equal to the number of ways to select n — k shirts from among n. 
Therefore: 
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This is easy to prove algebraically, since both sides are equal to: 



k\ (n-k)V 



But we didn't really have to resort to algebra; we just used counting principles. 
Hmmm 

11.9.1 Pascal's Identity 

Jay, famed Math for Computer Science Teaching Assistant, has decided to try out 
for the US Olympic boxing team. After all, he's watched all of the Rocky movies 
and spent hours in front of a mirror sneering, "Yo, you wanna piece a' me?!" Jay 
figures that n people (including himself) are competing for spots on the team and 
only k will be selected. As part of maneuvering for a spot on the team, he needs to 
work out how many different teams are possible. There are two cases to consider: 

• Jay is selected for the team, and his k — 1 teammates are selected from among 
the other n— 1 competitors. The number of different teams that can be formed 
in this way is: 



among the other n — 1 competitors. The number of teams that can be formed 
this way is: 



All teams of the first type contain Jay, and no team of the second type does; 
therefore, the two sets of teams are disjoint. Thus, by the Sum Rule, the total 
number of possible Olympic boxing teams is: 



Jeremy, equally-famed Teaching Assistant, thinks Jay isn't so tough and so he 
might as well also try out. He reasons that n people (including himself) are trying 
out for k spots. Thus, the number of ways to select the team is simply: 




• Jay is not selected for the team, and all k team members are selected from 
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Jeremy and Jay each correctly counted the number of possible boxing teams. 
Thus, their answers must be equal. So we know: 




This is called Pascal's Identity. And we proved it without any algebra! Instead, we 
relied purely on counting techniques. 

11.9.2 Finding a Combinatorial Proof 

A combinatorial proof is an argument that establishes an algebraic fact by relying 
on counting principles. Many such proofs follow the same basic outline: 

1. Define a set S. 

2. Show that \ S\ = n by counting one way. 

3. Show that |5| = m by counting another way. 

4. Conclude that n = m. 

In the preceding example, S was the set of all possible Olympic boxing teams. Jay 
computed 

\S\ ' * ? " ' 

by counting one way, and Jeremy computed 

by counting another way. Equating these two expressions gave Pascal's Identity. 

More typically, the set 5* is defined in terms of simple sequences or sets rather 
than an elaborate story. Here is a less colorful example of a combinatorial argu- 
ment. 

Theorem 11.9.1. 

2n 




£ " L-r 



r=0 
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Proof. We give a combinatorial proof. Let S be all n-card hands that can be dealt 
from a deck containing n red cards (numbered 1, . . . , n) and 2n black cards (num- 
bered 1, . . . , In). First, note that every 3«-element set has 



m -element subsets. 

From another perspective, the number of hands with exactly r red cards is 




since there are (") ways to choose the r red cards and (^" r ) ways to choose the 
n — r black cards. Since the number of red cards can be anywhere from to n, the 
total number of n-card hands is: 




Equating these two expressions for | S | proves the theorem. ■ 

Combinatorial proofs are almost magical. Theorem 1 1.9.1 looks pretty scary, but 
we proved it without any algebraic manipulations at all. The key to constructing 
a combinatorial proof is choosing the set S properly, which can be tricky. Gener- 
ally, the simpler side of the equation should provide some guidance. For example, 
the right side of Theorem 11.9.1 is ( 3 "), which suggests that it will be helpful to 
choose S to be all n -element subsets of some 3n -element set. 



11.10 The Pigeonhole Principle 

Here is an old puzzle: 

A drawer in a dark room contains red socks, green socks, and blue 
socks. How many socks must you withdraw to be sure that you have a 
matching pair? 

For example, picking out three socks is not enough; you might end up with one 
red, one green, and one blue. The solution relies on the Pigeonhole Principle, 
which is a friendly name for the contrapositive of the injective case of the Mapping 
Rule. 
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A f B 
1 st sock • red 




Figure 11.3 One possible mapping of four socks to three colors. 



Rule 11.10.1 (Pigeonhole Principle). If\X\ > \Y\, then for every total function 5 
f : X Y, there exist two different elements of X that are mapped to the same 
element ofY. 

What this abstract mathematical statement has to do with selecting footwear un- 
der poor lighting conditions is maybe not obvious. However, let A be the set of 
socks you pick out, let B be the set of colors available, and let / map each sock 
to its color. The Pigeonhole Principle says that if \A\ > \B\ = 3, then at least two 
elements of A (that is, at least two socks) must be mapped to the same element of 
B (that is, the same color). Therefore, four socks are enough to ensure a matched 
pair. For example, one possible mapping of four socks to three colors is shown in 
Figure 11.3. 

Not surprisingly, the pigeonhole principle is often described in terms of pigeons: 

If there are more pigeons than holes they occupy, then at least two 
pigeons must be in the same hole. 

In this case, the pigeons form set A, the pigeonholes are set B, and / describes 
which hole each pigeon flies into. 

Mathematicians have come up with many ingenious applications for the pigeon- 
hole principle. If there were a cookbook procedure for generating such arguments, 
we'd give it to you. Unfortunately, there isn't one. One helpful tip, though: when 
you try to solve a problem with the pigeonhole principle, the key is to clearly iden- 
tify three things: 

5 This Mapping Rule applies even if / is a total injective relation. Recall that a function is total if 
Vx € X 3y € Y. f{x) = y. 
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1. The set A (the pigeons). 

2. The set B (the pigeonholes). 

3. The function / (the rule for assigning pigeons to pigeonholes). 

11.10.1 Hairs on Heads 

There are a number of generalizations of the pigeonhole principle. For example: 

Rule 11.10.2 (Generalized Pigeonhole Principle). If \X\ > k • \ Y\, then every total 
function f : X -> Y maps at least k + 1 different elements of X to the same 
element ofY. 

For example, if you pick two people at random, surely they are extremely un- 
likely to have exactly the same number of hairs on their heads. However, in the 
remarkable city of Boston, Massachusetts there are actually three people who have 
exactly the same number of hairs! Of course, there are many bald people in Boston, 
and they all have zero hairs. But we're talking about non-bald people; say a person 
is non-bald if they have at least ten thousand hairs on their head. 

Boston has about 500,000 non-bald people, and the number of hairs on a person's 
head is at most 200,000. Let A be the set of non-bald people in Boston, let B = 
{10, 000, 10, 001, . . . , 200, 000}, and let / map a person to the number of hairs on 
his or her head. Since \A\ > 2\B\, the Generalized Pigeonhole Principle implies 
that at least three people have exactly the same number of hairs. We don't know 
who they are, but we know they exist! 

11.10.2 Subsets with the Same Sum 

For your reading pleasure, we have displayed ninety 25-digit numbers in Fig- 
ure 11.4. Are there two different subsets of these 25-digit numbers that have the 
same sum? For example, maybe the sum of the last ten numbers in the first column 
is equal to the sum of the first eleven numbers in the second column? 

Finding two subsets with the same sum may seem like a silly puzzle, but solving 
these sorts of problems turns out to be useful in diverse applications such as finding 
good ways to fit packages into shipping containers and decoding secret messages. 

It turns out that it is hard to find different subsets with the same sum, which 
is why this problem arises in cryptography. But it is easy to prove that two such 
subsets exist. That's where the Pigeonhole Principle comes in. 

Let A be the collection of all subsets of the 90 numbers in the list. Now the sum 
of any subset of numbers is at most 90 • 10 25 , since there are only 90 numbers and 
every 25-digit number is less than 10 25 . So let B be the set of integers {0, 1 , . . . , 90- 
10 25 }, and let / map each subset of numbers (in A) to its sum (in B). 
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Figure 11.4 Ninety 25-digit numbers. Can you find two different subsets of these 


numbers that have the same sum? 
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We proved that an m -element set has 2" different subsets in Section 1 1.2. There- 
fore: 

\A\ = 2 90 > 1.237 x 10 27 

On the other hand: 

\B\ = 90- 10 25 + 1 < 0.901 x 10 27 . 

Both quantities are enormous, but \A\ is a bit greater than \B\. This means that / 
maps at least two elements of A to the same element of B. In other words, by the 
Pigeonhole Principle, two different subsets must have the same sum! 

Notice that this proof gives no indication which two sets of numbers have the 
same sum. This frustrating variety of argument is called a nonconstructive proof. 
To see if was possible to actually find two different subsets of the ninety 25-digit 
numbers with the same sum, we offered a $100 prize to the first student who did it. 
We didn't expect to have to pay off this bet, but we underestimated the ingenuity 
and initiative of the students. One computer science major wrote a program that 
cleverly searched only among a reasonably small set of "plausible" sets, sorted 
them by their sums, and actually found a couple with the same sum. He won the 
prize. A few days later, a math major figured out how to reformulate the sum 
problem as a "lattice basis reduction" problem; then he found a software package 
implementing an efficient basis reduction procedure, and using it, he very quickly 
found lots of pairs of subsets with the same sum. He didn't win the prize, but he 
got a standing ovation from the class — staff included. 



11.11 A Magic Trick 

There is a Magician and an Assistant. The Assistant goes into the audience with a 
deck of 52 cards while the Magician looks away. 

Five audience members each select one card from the deck. The Assistant then 
gathers up the five cards and holds up four of them so the Magician can see them. 
The Magician concentrates for a short time and then correctly names the secret, 
fifth card! 

Since we don't really believe the Magician can read minds, we know the As- 
sistant has somehow communicated the secret card to the Magician. Since real 
Magicians and Assistants are not to be trusted, we can expect that the Assistant 
would illegitimately signal the Magician with coded phrases or body language, but 
they don't have to cheat in this way. In fact, the Magician and Assistant could be 



34 



"mcs-ftl" — 2010/9/8 — 0:40 — page 347 — #353 



11.11. A Magic Trick 

Sets with Distinct Subset Sums 

How can we construct a set of n positive integers such that all its subsets have 
distinct sums? One way is to use powers of two: 

{1,2,4,8,16} 

This approach is so natural that one suspects all other such sets must involve larger 
numbers. (For example, we could safely replace 16 by 17, but not by 15.) Remark- 
ably, there are examples involving smaller numbers. Here is one: 

{6,9,11,12,13} 

One of the top mathematicians of the Twentieth Century, Paul Erdos, conjectured in 
1931 that there are no such sets involving significantly smaller numbers. More pre- 
cisely, he conjectured that the largest number in such a set must be greater than c2 n 
for some constant c > 0. He offered $500 to anyone who could prove or disprove 
his conjecture, but the problem remains unsolved. 
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kept out of sight of each other while some audience member holds up the 4 cards 
designated by the Assistant for the Magician to see. 

Of course, without cheating, there is still an obvious way the Assistant can com- 
municate to the Magician: he can choose any of the 4! = 24 permutations of the 
4 cards as the order in which to hold up the cards. However, this alone won't 
quite work: there are 48 cards remaining in the deck, so the Assistant doesn't have 
enough choices of orders to indicate exactly what the secret card is (though he 
could narrow it down to two cards). 

11.11.1 The Secret 

The method the Assistant can use to communicate the fifth card exactly is a nice 
application of what we know about counting and matching. 

The Assistant has a second legitimate way to communicate: he can choose which 
of the five cards to keep hidden. Of course, it's not clear how the Magician could 
determine which of these five possibilities the Assistant selected by looking at the 
four visible cards, but there is a way, as we'll now explain. 

The problem facing the Magician and Assistant is actually a bipartite matching 
problem. Put all the sets of 5 cards in a collection X on the left. And put all the 
sequences of 4 distinct cards in a collection Y on the right. These are the two sets 
of vertices in the bipartite graph. There is an edge between a set of 5 cards and 
a sequence of 4 if every card in the sequence is also in the set. In other words, if 
the audience selects a set of 5 cards, then the Assistant must reveal a sequence of 
4 cards that is adjacent in the bipartite graph. Some edges are shown in the diagram 
in Figure 11.5. 

For example, 

{8V,K*,Q*,20,60} (11.4) 

is an element of X on the left. If the audience selects this set of 5 cards, then 
there are many different 4-card sequences on the right in set Y that the Assis- 
tant could choose to reveal, including (89, K*, Q*, 20), (K0>, 89, g*, 20), and 
89, 60, £>*)• 

What the Magician and his Assistant need to perform the trick is a matching for 
the X vertices. If they agree in advance on some matching, then when the audience 
selects a set of 5 cards, the Assistant reveals the matching sequence of 4 cards. The 
Magician uses the matching to find the audience's chosen set of 5 cards, and so he 
can name the one not already revealed. 

For example, suppose the Assistant and Magician agree on a matching containing 
the two bold edges in Figure 1 1.5. If the audience selects the set 

{89,^*,g*,9*,60}, (11.5) 
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x=all 
sets of 
5 cards 



{8C?,tf*,g*,20,6<>} 
{8V,m,Q+,9+,60} 



Figure 11.5 The bipartite graph where the nodes on the left correspond to sets 
of 5 cards and the nodes on the right correspond to sequences of 4 cards. There is 
an edge between a set and a sequence whenever all the cards in the sequence are 
contained in the set. 

then the Assistant reveals the corresponding sequence 

(K*,8V,60,Q*). (11.6) 

Using the matching, the Magician sees that the hand (11.5) is matched to the se- 
quence (11.6), so he can name the one card in the corresponding set not already 
revealed, namely, the 94k. Notice that the fact that the sets are matched, that is, 
that different sets are paired with distinct sequences, is essential. For example, if 
the audience picked the previous hand (1 1.4), it would be possible for the Assistant 
to reveal the same sequence (11.6), but he better not do that; if he did, then the 
Magician would have no way to tell if the remaining card was the 9Jfr or the 20. 

So how can we be sure the needed matching can be found? The answer is that 
each vertex on the left has degree 5-4! = 120, since there are five ways to select 
the card kept secret and there are 4! permutations of the remaining 4 cards. In 
addition, each vertex on the right has degree 48, since there are 48 possibilities for 
the fifth card. So this graph is degree-constrained according to Definition 5.2.6, 
and therefore satisfies Hall's matching condition. 

In fact, this reasoning shows that the Magician could still pull off the trick if 120 
cards were left instead of 48, that is, the trick would work with a deck as large as 
124 different cards — without any magic! 
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A 

K 2 
Q 3 

/ 4 

10 5 

9 6 
8 7 

Figure 11.6 The 13 card ranks arranged in cyclic order. 
11.11.2 The Real Secret 

But wait a minute! It's all very well in principle to have the Magician and his 
Assistant agree on a matching, but how are they supposed to remember a matching 
with ( 5 5 2 ) = 2, 598, 960 edges? For the trick to work in practice, there has to be a 
way to match hands and card sequences mentally and on the fly. 

We'll describe one approach. As a running example, suppose that the audience 
selects: 

100? 90 30? JO. 

• The Assistant picks out two cards of the same suit. In the example, the 
assistant might choose the 30? and 100?. This is always possible because of 
the Pigeonhole Principle — there are five cards and 4 suits so two cards must 
be in the same suit. 

• The Assistant locates the ranks of these two cards on the cycle shown in Fig- 
ure 11.6. For any two distinct ranks on this cycle, one is always between 1 
and 6 hops clockwise from the other. For example, the 30? is 6 hops clock- 
wise from the 100?. 

• The more counterclockwise of these two cards is revealed first, and the other 
becomes the secret card. Thus, in our example, the 100? would be revealed, 
and the 30? would be the secret card. Therefore: 

- The suit of the secret card is the same as the suit of the first card re- 
vealed. 
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- The rank of the secret card is between 1 and 6 hops clockwise from the 
rank of the first card revealed. 

• All that remains is to communicate a number between 1 and 6. The Magician 
and Assistant agree beforehand on an ordering of all the cards in the deck 
from smallest to largest such as: 

A* AO AV A* 2* 20 29 2* . . . KV K* 

The order in which the last three cards are revealed communicates the num- 
ber according to the following scheme: 



( small, 


medium, 


large ) 


= 1 


( small, 


large, 


medium ) 


= 2 


( medium, 


small, 


large ) 


= 3 


( medium, 


large, 


small ) 


= 4 


( large, 


small, 


medium ) 


= 5 


( large, 


medium, 


small ) 


= 6 



In the example, the Assistant wants to send 6 and so reveals the remaining 
three cards in large, medium, small order. Here is the complete sequence that 
the Magician sees: 

109 JO 90 

• The Magician starts with the first card, 109, and hops 6 ranks clockwise to 
reach 39, which is the secret card! 

So that's how the trick can work with a standard deck of 52 cards. On the other 
hand, Hall's Theorem implies that the Magician and Assistant can in principle per- 
form the trick with a deck of up to 124 cards. It turns out that there is a method 
which they could actually learn to use with a reasonable amount of practice for a 
124-card deck, but we won't explain it here. 6 

11.11.3 The Same Trick with Four Cards? 

Suppose that the audience selects only four cards and the Assistant reveals a se- 
quence of three to the Magician. Can the Magician determine the fourth card? 

Let X be all the sets of four cards that the audience might select, and let Y be all 
the sequences of three cards that the Assistant might reveal. Now, on one hand, we 
have 




= 270, 725 



6 See The Best Card Trick by Michael Kleber for more information. 
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by the Subset Rule. On the other hand, we have 

\Y\ = 52-51 -50 = 132,600 

by the Generalized Product Rule. Thus, by the Pigeonhole Principle, the Assistant 
must reveal the same sequence of three cards for at least 

"270,7251 _ 
132,600 

different four-card hands. This is bad news for the Magician: if he sees that se- 
quence of three, then there are at least three possibilities for the fourth card which 
he cannot distinguish. So there is no legitimate way for the Assistant to communi- 
cate exactly what the fourth card is ! 

11.11.4 Never Say Never 

No sooner than we finished proving that the Magician can't pull off the trick with 
four cards instead of five, a student showed us a way that it might be doable after 
all. The idea is to place the three cards on a table one at a time instead of revealing 
them all at once. This provides the Magician with two completely independent 
sequences of three cards: one for the temporal order in which the cards are placed 
on the table, and one for the spatial order in which they appear once placed. 
For example, suppose the audience selects 

109 90 39 

and the assistant decides to reveal 

109 90 Q*. 

The assistant might decide to reveal the g4k first, the 109 second, and the 90 third, 
thereby production the temporal sequence 

(fi*. 109, 90). 

If the is placed in the middle position on the table, the 109 is placed in the 
rightmost position on the table, and the 90 is placed in the leftmost position on the 
table, the spatial sequence would be 

(90, Q*, 109). 

In this version of the card trick, X consists of all sets of 4 cards and Y consists 
of all pairs of sequences of the same 3 cards. As before, we can create a bipartite 
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graph where an edge connects a set S of 4 cards in X with a pair of sequences in Y 
if the 3 cards in the sequences are in S. 
The degree of every node in X is then 

4 -3! -3! = 144 

since there are 4 choices for which card is not revealed and 3! orders for each 
sequence in the pair. 

The degree of every node in Y is 49 since there are 52 — 3 = 49 possible choices 
for the 4th card. Since 144 > 49, we can use Hall's Theorem to establish the 
existing of a matching for X. 

Hence, the magic trick is doable with 4 cards — the assistant just has to convey 
more information. Can you figure out a convenient way to pull off the trick on the 
fly? 

So what about the 3-card version? Surely that is not doable.. . . 
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